Predicting robustness against transient faults of MPI based programs
نویسندگان
چکیده
The evaluation of a program’s behavior in the presence of transient faults is often a very time consuming work. In order to achieve significant data, thousands of executions are normally required and each execution will have the significant overhead of the fault injection environment. Our previously published methodology reduced significantly the time needed to evaluate the robustness of a program execution by exhaustively analyzing its basic blocks trace instead of using fault injection. In this paper we present an even forward improvement in the evaluation time of parallel programs robustness against transient faults by combining our methodology with PAS2P – a method that strives to describe an application based on its messagepassing activity. The combination of our approach and PAS2P allowed us to predict the robustness of larger parallel programs, reducing in some cases in more than 20 times the time needed to calculate the robustness while obtaining a robustness prediction error of less than 4%. Transient faults, robustness, soft errors, reliability, PAS2P
منابع مشابه
Comparison of transient ischemic dilation ratios in SPECT and SPECT-CT myocardial perfusion imaging in the low pre-test probability group
Introduction: The main purpose of this study was to compare transient ischemic dilation (TID) ratios in SPECT-low dose CT and SPECT Myocardial Perfusion Imaging (MPI) by application of different quantitative programs and quantify the possible shift in the upper normal limits of TID ratio in the SPECT-CT MPI. Methods: 149 Patients with low pre-test probability for coronary artery disease (CAD),...
متن کاملA Methodology to Calculate a Program’s Robustness against Transient Faults
Computer chips implementation technologies are evolving to obtain more performance. The side effect of such a scenario is that processors are less robust than ever against transient faults. As on-chip solutions are expensive or tend to degrade processor performance, the efforts to deal with these transient faults in higher levels are increasing. Software based fault tolerance approaches against...
متن کاملDebugging Tool for Localizing Faulty Processes in Message Passing Programs
In message passing programs, once a process terminates with an unexpected error, the terminated process can propagate the error to the rest of processes through communication dependencies, resulting in a program failure. Therefore, to locate faults, developers must identify the group of processes involved in the original error and faulty processes that activate faults. This paper presents a nov...
متن کاملRobust Model- Based Fault Detection and Isolation for V47/660kW Wind Turbine
In this paper, in order to increase the efficiency, to reduce the cost and to prevent the failures of wind turbines, which lead to an extensive break down, a robust fault diagnosis system is proposed for V47/660kW wind turbine operated in Manjil wind farm, Gilan province, Iran. According to the acquired data from Iran wind turbine industry, common faults of the wind turbine such as sensor fault...
متن کاملComparison of transient ischemic dilation ratios in SPECT and SPECT-CT myocardial perfusion imaging in the low pre-test probability group
Introduction: The main purpose of this study was to compare transient ischemic dilation (TID) ratios in SPECT-low dose CT and SPECT Myocardial Perfusion Imaging (MPI) by application of different quantitative programs and quantify the possible shift in the upper normal limits of TID ratio in the SPECT-CT MPI. Methods: 149 Patients with low pre-test probability for coronary artery disease (CAD), ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- IJCSE
دوره 12 شماره
صفحات -
تاریخ انتشار 2016